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ABSTRACT 



The need for tools capable of handling non-stationariiies in the spectral content of 
the data has been recognized as early as 1946. The Wigner-Ville Distribution (WD) has 
been extensively used since its introduction in 1948, but suffers from some associated 
problems (e.g., spectral cross-terms and requiring the use of analytic signals). An alter- 
native Distribution is proposed, which has its origin in the definition proposed by Page 
of "Instantaneous Power Spectrum" (IPS). Its characteristics are examined and, when 
pertinent, compared to the WD. It is shown to be less sensitive to the problems aflicting 
the W'D, but provides less frequency resolution. The usefulness of a parametric (AR) 
version is investigated. Some typical test signals are examined, to demonstrate the per- 
formance and trade-offs of IPS and its parametric version. 
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I. INTRODUCTION 



A. THE PROBLEM 

The use of the Fourier Transform as a spectral description of signals is a concept 
whose usefulness is restricted to the class of stationar>' signals. Though mathematically 
elegant and convenient, Fourier decomposition of a signal can often mask the true 
spectrum, since the assumed basis-functions implicitly oppose any notion of time- 
dependency. 

When the spectral content of the signal changes with time, as is often the case in 
fields such as Communications. Seismology or Speech Processing, more powerful tools 
are needed. The time-dependency of the spectral content should be apparent and meas- 
urable through the use of a more general type of representation of the signal. Gabor 
(Ref 1] proposed such a representation by introducing the concept of a spectral de- 
scription in a joint time-frequency plane. 

Since then, several attempts have been made [Refs. 2, 3, 4, 5] to derive a, or the. 
function capable of correctly describing the distribution of the signal's energy in this 
plane. Short Time Fourier .Analysis (STFA) has been widely used and is regarded as a 
very convenient approximation to the “true” distribution. 

However, obtaining this distribution is an under-determined problem, since an in- 
finity of valid solutions is introduced by allowing non-stationary descriptions. For 
example, any time signal can be represented as a time-varying DC component. To 
illustrate, consider the following discrete time signal: 



Two equally valid representations, in the sense that each one is capable of generat- 
ing the observed signal, are shown in Figures 1 and 2. 

Each one of these descriptions explains the existence of a sinusoidal time-series. 
Though there is no conceptual distinction between them, the algorithm of analysis must 
have the ability to decide which one will be taken as the “true" one, based on an ex- 
plicitly or implicitly built-in set of rules. The situation thus arises, where “one will see 
what one wants to see". 




( 1 ) 



1 





Further complications appear when we consider the fact that, given a time-series, 
we cannot deterinine if the underlying process is or is not stationary, without using ar- 
bitrary and often inappropriate assumptions about what is the local behavior. When the 
algorithm of analysis produces a time-varying spectrum, the fact remains that there is 
at least one equally valid time-invariant spectral description (the Fourier Transform 
based), which is arbitrarily ruled out. 

B. APPROACHES 

If the signal is to be represented in the joint time-frequency plane in a sensible 
manner, the distribution must have some key properties. For example, a shift in the 
time-series should always imply a corresponding shift of the spectral representation 
along the time-axis. Also, a multiplication of the time-series by a complex exponential 
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should result into a shift of the spectral representation along the frequency axis. Without 
these properties, physical interpretation of the representation can be an impossible goal. 
This is one of the reasons why Ambiguity functions are not used in spectral estimation. 

This technique of imposing constraints that are felt needed in a distribution can be 
carried further, and the distribution (or one of the distributions) formed in this refine- 
ment process defined as the true distribution. One of the most successful representations 
obtained by this approach is the Wigncr-Ville Distribution (WD), which has extensively 
been used since its introduction in 1948 [Ref 2]. 

A different approach to solve the indeterminacy is to define, a priori and unambig- 
uously, what the considered true distribution is. Though this approach lacks control of 
the properties of the resulting distribution, it has the advantage of more closely pre- 
serving physical meaning in whatever results it produces. A typical example is the 
definition of "Instantaneous Power Spectrum" proposed by Page [Ref 5], uniquely 
determining a resulting distribution which is amenable to physical interpretation. 

C. OBJECTIVES 

The Wigner-Ville Distribution has been extensively studied [Refs. 6, 7, 8. 9 ]. and its 
characteristics are fairly well understood. However, its use is still hampered by some 
problems. First, it has uneven performance for different classes of signals. It performs 
optimally for single-component linear F.VI (linear chirp), but has worse performance for 
less regular spectral dynamics. Also, when multi-component signals are present, the WD 
creates artifacts in the spectrum, lying mid-way between true components. These 
artifacts are. up to some degree, recognizable and treatable due to an alternating sign 
[Ref 6]. but can severely mask results when analyzing more complex signals. The use 
of analytic signals is usually required, not only to avoid the need for sampling at twice 
the Nyquist rate, but also to prevent the appearance of undesirable artifacts that would 
otherwise be created by the interference between positive and negative frequencies 
[Refs. 6, 7. 10]. 

Though it has not received proper attention in the literature. Page's Instantaneous 
Power Spectrum is. once properly understood and treated, a practical alternative to es- 
timate the time-varying spectrum. 

The study and development of such an alternative is the main goal of this thesis. 
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II. INSTANTANEOUS POWER SPECTRUM (IPS) 



A. DEFINITION 

In an attempt to accommodate the notion of instantaneous frequency content, Page 
defined the Instantaneous Power Spectrum as the derivative of a running energy spec- 
trum [Ref 5]: 






( 2 ) 



where 



•sr(/) = 



s(t)c . 



( 3 ) 



That is, the Instantaneous Power Spectrum was defined as being, at each frequency, 
the rate of change of the energy collected by a Fourier transform taken from — oo . up 
to the time of analysis. This concept was later extended by Levin [Ref 3]. who intro- 
duced a complementary backward run, similarly defined as: 



where 



•s;(/)= 



s{x)e 



( 5 ) 



and defining the Instantaneous Power Spectrum as the average of these two runs: 



ips{t,f) = ^[p-{t,f + p\t, y)]. 



(6) 
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This expression can be put in the form (Refs. 3, 1 1 ( 

/PS(r,y)=Real[s*WS(y)e'''"-^'] (7) 

where S(f) is the Fourier Transform of i(r). 

IPS can thus be seen to be the real part of Rihaczek's distribution [Ref 4], which 
allows one to think of lPS(i,f) as being the energy density entering, at time i, an infi- 
nitely narrow filter centered at frequency / An enlightening treatment of this point can 
be found in Ackroyd [Ref 12|. 

1. Time Domain 

Let us now consider the following: we want a function of i, t whose Fourier 
Transform is IPS{i,J) . That is, we want G(/, t) such that 






oc 

G(r, x)e~P"^'(h . 



— OC 



( 8 ) 



Then, using (7) 



OO 



G(f. t) = ^ 






— OO 



from where, with the appropriate variable substitutions we obtain 



( 9 ) 






OO OO 







-n 


5(0 


s {[ — + 5 (r) 


5(r + 


1— 


-OO 


-OO 






(10) 



Interchanging the order of integration, we get 



G(/, t) = y - "r) + s\i)s{t + t)] . 



( 11 ) 
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IPS can now be restated as: 



IPS{i,f)=\ 



oo 

[ 5(^)5 (r — t) + 5 (r)5(r + t)] . 

%> 

—00 



( 12 ) 



2. Frequency domain 

Alternatively, a dual expression can be derived, expressing IPS in terms of the 
Fourier Transform of the signal. 



7PS(/,y) = ^ 



s{i)s*{r — r)e 



— >0 



5 {i)s{[ + z)e ^^“'^‘dz 



00 00 

r r 






S {y)e ‘ Vy e ^^ "^'dz + 



%J %J 

—00 —00 



J 

—00 



OC' 00 

r r 












\J \J 

—00 —00 



%) 

—00 



CO 00 

r r 



S{^l)S\y)e‘^'^' e ^^'^’d{y -f)dydn + 



t ) x ) 

—00 —00 



(13) 






00 DO 

r 



S\u)S{y)e e^^~^'d{y -f)dyd^ 



—00 —00 
00 






S{n)S{f)e-^^^-^-^’dn 



6 



With the appropriate variable substitutions, we get 



IPS{!, f) = -^ 



S{f+ + y 



%/ 

— oo 



S{f- 



(14) 



and, finally 



IPS{t,f) 



/* 



OO 




[S(/)S V- y) + S{J)S{f+ y)] . 



3. IPS for discrete signals 

The discrete version ofIPS follows directly from ( 12 ) as 



(15) 



lPS(n, 6) = — ^ [■s(/?)s [n — k) + s {n)s{n + A’)]c (16) 

A'= — DC 

A formal derivation of ( 16 ) can be found in Appendix A. 

B. RELATIONS WITH OTHER TIME-FREQUENCY DISTRIBUTIONS 
1. Rihaczek's Distribution 

The Rihaczek Distribution was proposed in 196S as the true energy' distribution 
in an attempt to unify existing results. 

Formally, the distribution is [Ref 4] 

e{i.^r) = s{i)S(J)e-P^P (17) 

where s(t) is the analytic signal being analyzed, and S{f) is its Fourier Transform. 

As was derived. £(/„. A) is the complex energy density at the point (to-A) i^> the 
time-frequency plane. A similar expression had been considered by Levin, who defined 
the Complex Insianianeous Power Spectrum as the complex conjugate of ( 17 ). 



* Rihaczek used the concept of “complex energy”, whose real part is the real energy, while the 
imaginaiy part is the reactit'e energy. 
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Using Levin's notation, the Complex Instantaneous Power Spectrum is 



A'(r,y) = 5"*(/)S(/)/ 



■2^/1 



( 18 ) 



where s(/) is the complex envelope of the real signal under analysis. 

We thus see from ( 7 ) that, though addressing a broader class of signals, IPS 
is both the real part of Rihaczek's Distribution and the real part of Levin's Complex 
Instantaneous Power Spectrum. Formally, 

/PS(/,y) = Real[£(r, y)] . (19) 

2. Cohen's Generalized Phase-Space Distribution Functions 

In 1966, Cohen introduced a generalized class of time-frequency signal repres- 
entations, given by 



OO OO OO 

r r r 



C{LJ) = 



— OO — OO — OO 



d>(o, t) 5(r, -t- )5*(/| — -y ) ^^dodt^dr 



( 20 ) 



where the choice of 0(u, t) will determine the resulting distribution [Ref 13]. 
If we slightly rearrange ( 20 ) to read 







0 




c(L J) = 

%) 


%) 


0(a, t) 


— «/ 



5(h + y )5*(h - y )e’-''-‘^dt^ 



e~^^''^''‘^"f^dodx , (21) 



— OO — OC- 



we can interpret it as the double Fourier Transform of 4'(o, t) [Ref 14]. where 



OO 



( 22 ) 



— OO 



T(l), t) = 0>(a, t) 



5(/,+-?-)5*(h-^)^^’^''Vr, 



We can now recognize ( 22 ) as a general expression for the well known Ambi- 
guity function, which can serve as a general definition of the time/ frequency 
autocorrelation function [Refs. 3, 4 ]. 
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Each choice of weighting function 0(u, t) in ( 20 ) provides a different distrib- 
ution, because each one defines the combined autocorrelation ( 22 ) in a somewhat dif- 
ferent way. 

Let us see which distributions result from three different realizations of 0(u, t) 
(complex exponential, constant, and real sinusoid) [Refs. 4, 13, 14 ]. 



a. Complex exponential 



C>(l), t) = C 2 



(23) 



The combined autocorrelation becomes 



OO 



4'(u, t) = 5(/i -I- y )5*(r, - y ) 



%) 

— OO 



■ (24) 



OO 




J 



and. defining a new variable / = r, 4- -y, we obtain 



OO 





The resulting distribution is 



OO OO OO 

r r r 



c{t, f) = 



s(ri)s (tj — e 



(26) 



J J 



— OO — OO — OO 
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oo oo 



c{l, J) = 



5(ri)5 (r, — r) e — [)dr^dT 



\} \J 

— OO — oo 



5(/)5 (/ — z)e 



= :s(/)S (/)e 






( 27 ) 



which is the well-known Rihaczek Distribution. 

Though this definition of the combined autocorrelation is probably the 
most intuitive one. its lack of mathematical symmetry has the undesirable effect of 
making the distribution complex. It has extensively been used in radar theory, since 
( 25 ) is exactly Woodward's definition of the Ambiguity function [Ref. 3]. 



b. Constant 



O(-), t) = 1 



(2S) 



The combined autocorrelation becomes 



OO 

r 



(a, r) = 



+ y )s\t^ - y ) . 



%) 

— OO 



The resulting distribution is 



(29) 



OO OO OO 

/%/%/% 



c(b J) = 



%J 

—oo 



•j(h + T )^*(^1 “ "T 



1, 



dtidxdo 



— OO — oo 



(30) 
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oo 






c{t.D = 



%} 

“OO 



oo 

5(/, + j- )s\i, - f - t)di,dz 

X) 

—OO 



oo 

r 



s{t + -y )5 ^ )e 



-nr^fd^ 



■oo 



(31) 



which is the Wigner-Ville Distribution. This definition of the combined autocorrelation, 
though only slightly diflerent from the one in ( 25 ), has the needed symmetr>' to make 
this distribution real, and mathematically very convenient (when dealing with analytic 
signals). 



c. Cosine function 



O(o, t) = cos(;ri)T) 
The combined autocorrelation becomes 



(32) 



oc 



T(t», t) = cos(;:i)t) 



5(/, + )5*(r, - Ar )f^~''’'dt^ 



“OO 



+ e ■'"^■) 



sih + T “ "T 



%J 

“OO 






+ -T )^*(fi - -f 2 V/, 



I-*; 

— oo 



+ 



oo 

r 



sii,+j-)s\t,-j-)e’-^^‘^-T^di, 



%} 

“OO 



(33) 
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Substituting i for in the first integral, and t for t, — in the second, 



we get 









— OO 






(34) 



[s(/)s (t — t) + 5 (/)s(t + . 



J 

— OO 



The resulting distribution is 



cO- J) = 



OO OO OO 

r r r 



T (/] — t) + s (/i)s(ti + 



J 

— OO — OO — OO 



OO OO 

/» /% 



—OO 



y [5(t,)s‘(t, - t) + 5‘(h)5(h + '^)]<? 

— OO 



(35) 



1 



OO 

[s(/)s*(/ — t) + s*{t)s{t + T)\e~-^^~'-^dT 



— OO 



which is the expression we had found earlier for IPS. In this case, the definition of the 
combined autocorrelation also possesses the symmetry needed for a real distribution. 
The kernel that generates IPS ( (I>(a, t) = cos(7ruT) ) has been considered by Cohen 
[Ref. 13), and shown to generate the Margeneau-Hill distribution [Ref 15], well known 
in quantum mechanics theor>\ 

3. Wigner-Ville Distribution 

Since the Wigner-Ville Distribution (WD) has enjoyed wide acceptance, we will 
define more closely the relations between IPS and WD. As an indirect connection, we 
can relate both to the Rihaczek Distribution. From ( 29 ) the WD implicitly defines the 
combined autocorrelation as [Ref 4] 
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t) = 



OO 

/% 



*/ 

“OO 






OO 




%/ 

“OO 






(36) 



Hence, the Wigner-Ville Distribution is given by [Ref. 4] 



OO 







“1 




“/-t'T 

e 

- 


•5(/i)5*(/i - 



“OO 



(37) 



where F,,.[ . ] denotes the 2-D Fourier transform. From the convolution property of this 
operator, and using ( 25 ), 



]VD{t,J) = F^,[e 




OO 

5(/,)/(/i-T)e^'"^Vt, 



“OO 



= ** £(/, J) 



(3S) 



where ** stands for 2-D convolution, and c{i, f) is the Rihaczek Distribution. From 
( 38 ), using ( 19 ) and the realness property of WD (see Section 11-C), we obtain 



irD(/, y) = Real[e''"’^-^‘^** t{i, y)] 
7PS(/,y) = Real[,5(/,y)**£(r, y)] . 



(39) 



We thus see how the two distributions can be obtained from the Rihaczek Distribution 
by an appropriate choice of the function convolved with t[t, f) . 
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We found earlier that each choice of ^(o, x) in Cohen's class of Distributions 
( 20 ) yields a particular definition of the combined autocorrelation function 



OO 

T(u, t) = 0(1), x) 5(r, + ^ )5*(r, - ^ (40) 

%/ 

— OO 



and that the 2-D Fourier transform of ( 40 ) resulted in a particular distribution, de- 
pending on 0(i), x). 

We see now that ( 40 ) is the product of two functions, O(o, x) and the integral. 
Its double Fourier Transform is thus a double convolution in the transform 
domain. [Ref 14] 

That is: 









0(u, x) 



+ -T ~ 'T 



= F,.,[0(u, X)] 



OO 



(41) 






— OO 



= F,,,[cb(o,x)] ** ll 'D{t,J). 



In the particular case of IPS, 

1PS{1, j) = ,[ cos(tox)] ** WD{i, J ) . (42) 

Therefore. 

1PS{1, f) = cos{Anft) ** WD{t, f ) , (43) 

which we could have deduced directly from ( 39 ) and the fact that the WD is always 
real, as follows: 



IPS{t, f) = Real[c(r, y)] 

1 



(44) 



Using ( 38 ), 



IPS{i. y) = ^ ** WD{1, J) + ** \VD[i, f)] 

= y ** IUZ)(r, J) 

= cos(4V>) ** ^VD{i, f ) . 

At this point, we would like to be able to express the WD in terms of IPS, with 
a relation of the form: 



WD{t, f) = 0[/FS(r, ff] , 



(46) 



where 0[ . ] is an arbitrarx' operator. Unfortunately, the inverse filter does not exist, 
since 






COS(TtDT) 



(47) 



is not defined over the required range of a and t . 

4. Shoi1-Tinie Fourier Transform 

Though the Short-Time Fourier Transform (STFT) can not be considered a true 
Time- Frequency Distribution [Ref 14], its widespread use fully justifies the study of how 
it relates to IPS. This hs been addressed in [Ref II], and will be done here, after which 
some indirect results will emerge. Let us thus consider the STFT as defined in ( 48 ): 



STFT[i,io)= |F(/,co)|^ 



oo 

r 



2 



s{t + t)iv 



%} 

—CO 



CO CO 

c c 

s(r + t) 5 (r -I- v)w {T)w{o)e~^^'e^^'^^drdu 
J 

— CO —CO 



( 48 ) 
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Substituting s(/ + t) and vv'(t) by their Fourier definitions, vve have that 



STFT{t, cu) = 



S {[ + v)w{v) 






IV (p)e ^^‘dpe 



— oo — oo 



J 

— oo 



s {t + o)vv(u)S(p + co)lV [p)e^^^'^^^'e^“^dvdp 



( 49 ) 



— OO — OO 
oo OO 

r n 



s (r + o)S(p + {p)e ^^'^dvdp , 



— OO — oo 



We thus see that the STFT is a time-frequency smoothed version of the Rihaczek dis- 
tribution of the signal. The smoothing function is the Rihaczek distribution of the 
window used to compute the STFT. Since the right-hand side of { 49 ) is always real and 
positive, and the previous relations are valid for arbitrary' signals and windows, we con- 
clude that windowing a Rihaczek distribution with a Rihaczek distribution guarantees a 
real and everywhere positive distribution. This result will be important when addressing 
the issue of positivity. 

C. BASIC PROPERTIES OF IPS 

As was already mentioned, and is easily seen from ( 20 ), it is the choice of a par- 
ticular 0(i). t) that determines the particular distribution and, hence, its properties. It is 
thus desirable to establish direct connections between the properties of 0(u. t) and the 
properties of the resulting distribution. This issue has been addressed [Ref 14], and we 
will in the most part only state results, as applicable to IPS. The basic properties of IPS 
are; 

1 . Time shift 

If <y(r) = s{t — to), then 

= (50) 

2. Modulation 

If cu(/) = 5 (/)c'^”A)', then 

lPSJt,J) = lPSfi,f-fo). (51) 
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3. Marginal in Time 



4. Marginal in Frequency 



OO 



IPS{t,J)df=\s{t) \\ 



~oo 



OO 



IPS{t,j)dt = \SiJ) p. 



~oo 



where S(f) = f[5(/)] . 

5. Realness 

IPS{i, f) is real for all t, f. 

6. Instantaneous Frequency^ 

CC- 

f-IPS{t, J)df 

2 ~ fM {instantaneous frcquencv) 

I s{i) I 



7. Groap Delay 



OO 

flPS{t, J)dt 



= Tg{J) iSf'oup delay) 




8. Zero Power 



5(/o) = 0 7/>5(ro,y) = 0 

S(To) = 0 7PS(r,/o) = 0 



(52) 



(53) 



(54) 



(55) 



(56) 



(57) 



The Wigner-Ville Distribution also respects properties 1-7 [Ref. 6), but only a 
weaker version of property 8. 

D. FURTHER PROPERTIES 

Though the properties presented in Section 11-C constitute the basic set of rules to 
which IPS is bound, some further analysis must be made. The effects of linear operators 
should be investigated, to fully understand the behavior of IPS. 

' This property is valid for anahlic signals only ( see Appendix 0 ). 
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1. Windowing in the time-domain 

The issue here is to determine how IPS is afi'ecied by windowing the original 
time signal. 

We will at this point introduce the notation: 

y) = ^ J) + \ IPS^it, f) (58) 



where 



oo 

IPSJii, f) = d{t)d\t - (59) 

— OO 



and 



OO 

r* 

IPS^{t,J)= /(r)J(r + T)e-^"-^Vr. (60) 

% 

— OO 



If the signal is windowed, that is: 

^/) = s(r)u-(r), (61) 

then the IPS of the windowed signal will be 



OO 

r 



IPSJ,lJ) - ^ 



ld{t)d\i - r) + d\t)d{i - 



1 

0 



OO 

r 

[s(!)ni!)s\/ - t)w*(/ - t) + 

— OO 



-b 5 (/)w (r)s(r -b r)w(r -b r)]e . 



(62) 
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Separating the two terms, 



IPSjir, J) = j- 



[5(r)s (/ - T)][vv(r)vv*(/ - t)]e 






[s {t)s{i + t)][w (r)iv(r + r)]e 



— oo 
oo 



F \lPS;^F \lPS~:\e~^''^"dx 



%/ 

— OO 









F \lPS^2F \lPS^.^e~-'^''^"dx 



—OO 



1PS;{1, v)IPS-{r,f- o)do + -f 



(63) 



IPS;{1. o)IPSt{i,f- v)do , 



%J 

— OO 



where F [.] is the inverse Fourier Transform operator. Hence 

y) = ^ [irs;ii. f) • ips~[i, y)] + y y) * ipstu- y)] . (64) 



where * denotes convolution in the f variable. 

/ 

That is, while for the Wigner-Ville. windowing the time signal implies the con- 
volution along the frequency a.xis of the signal's WD with the window's \VD [Ref 6]. 
for IPS we have a sum of two convolutions: the first term of the signal's IPS {IPS;) is 
convolved with the first term of the window's IPS {IPS~), and a similar convolution is 
applied to {IPS;) and {IPS;) . 

A special case occurs when the window, the signal, or both, possess conjugate 
symmetry about i, the time at which we are computing IPS ( i.e., 

s{! + t) = s‘{[ — t), iv(; + t) = iv'(r — t), or both ). If iv(r) possesses that symmetry, then 

IPS;.{1, J) = IPStii, J) = IPS^ii, j) (65) 
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and. by linearity of the convolution operator, ( 64 ) becomes 



OO 



IPSJ.1, J) = 



0 )IPS^{ 1 , f- o)dv . 



— OO 



( 66 ) 



We thus see that in this case IPS and WD respond similarly to the windowing 
of the data; the IPS of the windowed signal is the convolution (in frequency) of the IPS 
of the signal with the IPS of the window. 

In summar}’, the IPS of a windowed signal is a frequency-smoothed version of 
the IPS of the unwindowed signal. The time-resolution is not affected. 

2. Convolution in the Time-domain 

The question now is: how is the IPS of a signal affected if the signal is pre- 
processed with a filtering operation? So, let 

d{i) = s{i) * h{i ) . (67) 



Then. 



IPS,iL j) = Real [[s‘(/) * /!*(/)] T[5(/) * /i(t)] 

= Real [[s*(/) * 

= Real [[5‘(/)5 (/)c^''"-^'] * [/i‘(t)//(/)c^'"-^']] , 



where we used the fact that 



* /i(r)] = 



a{T)b{l — T)dz 



— OC 






(69) 



%} 

“OO 
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We thus see that (contrary to uiiat happens in the WD’^ ) filtering the signal 
does not. in general, correspond to the convolution in time of the signal's IPS with the 
IPS of the impulse response of the filter. What is observable from ( 68 ) is that the IPS 
of the filtered signal is now the real part of the convolution, along the time axis, of the 
two Rihaezek Distributions. The net cfiTcct produced on IPS is, thus, still a smoothing 
operation along the time axis. The frequency resolution is not affected. 

3. IPS and Moyal's Formula 

As is often stated in the literature, IPS does not, in general, respect .Moyal's 
formula (Ref 16]. When applied to the WD, Moyal's formula states (Ref 17|: 



OO OO 



WD^{i. y)M ’/)„(/, fjduif = < s,g > < s.g > 



J J 

—OO — OO 



(70) 



where < s,g > is the inner product of s and g . Let us now see what happens in the IPS 
case; 



oc c>o 

r r 



IPSJi. f)lPS„{i, f)duif= 



J 

— OC’ — 



OC' oc 



J J 

— OO — 'jO 



ls(l)s'(l — t) + 5 (r)s(/ + t)]<> 



(71) 



(;g(/)g*(/ - o) + g*(/)g(/ + n)]c 



didf. 



J 

— OO 



‘ The W’D of a filtered signal is the convolution along the time-axis of the original signal's 
WD with the WD of the impulse response of the filter [Ref 6). 
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oo oo 

r ^ 



IPS,{i, f)IPS.{i, f)didf= 



— oo — oo 



oo oo oo 

r r r 



4 



[s(/)5*(r - t) + 5 (r)5(r + T)][^(/)g (/ - y) + g + o)] 



— OO — OO — OO 



^-J2r.j\T+v)^j. 



drdtdv 



(72) 



OO OO 

r r 



ls{t)s\t - t) + s\i)s{i + T)][g(r)^*(r + t) + - t)] di dx 



— OO —OO 



= — <s,g> <s.g> +— <s,g> <s.g> 



+ — Real 





oo 

r 




oo 










5 *(t- t)g*(/ + t)u't 


dt 




-oo 


%J 


-OO 





and. finally 



OO OO 



IPS,(L f)IPS.(i. f)didf= 



%J %J 

— oo — oo 



1 

2 



<s,g> <s,g> 



+ Real 





oo 

r 




OO 

f* 






smr) 




's*(r- T)g*(r + r)dx 


dr 




-OO 




■OO 





(73) 



The result confirms that Moyal's formula is not, in general, valid for IPS. Only 
in the trivial case of one of the functions being a real constant would the second term 
in the right hand-side of ( 73 ) reduce to <s,g> <s,g> ‘, and hence would IPS obey 
.Moyal's formula. 
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4. Recovery of time-signals 

To address the problem of recovering a signal from its IPS, we will find it con- 
venient to consider two separate cases. 

a. Infinite duration signals 

For real signals with S(0) ^ 0, recovery of the signal is always possible ex- 
cept for a sign indeterminacy, by considering 

IPS{i, 0) = Real[5(r)5(0)] 

(74) 

= S(0)s(0 

and, since | S(0) | can be found by using Property 4 ( 53 ), only the sign of S(0) remains 
undetermined. No similar results are available for complex signals of infinite duration. 
However, if s(r) possesses conjugate symmetry around some t^. then we can also recover 
complex signals of infinite duration up to a phase term, by considering F"' [ IPS{io.f} ]. 
Similarly, if S{f) possesses conjugate symmetry around some f and S(0) # 0, we can 
again recover complex signals of infinite duration up to a phase term, by considering 
FilPS{t,f)l 

b. Finite duration signals 

For finite duration signals, recovery of real or complex signals is always 
possible except for a constant phase term (or a sign, in the case of real signals). 

Let us denote the starting time by t = 0. From ( 12 ) 

F~\lPS(0. m=-^i s{0)s\ - T) + 5*(0)s(t) ] (75) 

and, since s( — t) = 0 for t > 0 . 

F~'[7?S(0, y)] |,>o = Y 5*(0)5 (t) (76) 

which, by property 3 ( 52 ), leaves an unknown phase term. 

E. IPS - A GENERALIZATION OF THE WIENER-KHINCHIN THEOREM 

By redefining auto-correlation, IPS offers a very straightforward extension of the 
Wiener-Khinchin theorem to the non-stationary case, as we shall see. 

Let us define the auto-correlation function of a signal as 

R{t, t) = -^ £'{s(/)s*(t — t) + 5 (/)s(/ -I- t)| . (77) 
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We have, then, using { 12 ) and by linearity of the convolution operator the follow- 
ing pair of relations: 



oc 






—oo 



R{l, t) 






R{i, t) = 



%/ 

—oo 



E{lPS{u J)] 



These relations can be read as a generalization of the Wiener- Khinchin theorem to 
non-stationary signals, since for stationarv^ signals they promptly reduce to that theorem. 
If s{t) is a stationary' signal, then 



R[t, t) = Y - t) + 5*(r)5(r -f t)} 

= j- £{5(/)/(r - t)} + ^ + t)} 

= ^(t) 



E{IPS{t.J)] 



OO 

R{T)e-^^^^''dr = I S(/) P 

J 

“OO 



(79) 



The expected value of IPS is, thus, for stationary signals, the usual Power Spectral 
Density { PSD ). 

Since in the non-stationary case we cannot try to infer ensemble averages from time 
averages, the IPS of a signal is one realization of the generalized Wiener-Khinchin re- 
lations of the underlying stochastic process. 

A different view of IPS when applied to stochastic processes can be found in Grace 
[Ref 1 1], where E{IPS{i, J)] is shown to be the Fourier Transform of Loeve's General- 
ized Power Spectral Density. 
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III. RETHINKING IPS 



A. ON THE USE OF WINDOWS 

IPS, as given in ( 12 ), sufTers from intense ringing at all frequencies “touched" by 
the signal. This same elfect is one of the reasons why Rihaezek's Distribution did not 
find much acceptance. An e.xamplc can be seen in Figure 3, where a linear chirp was 
used as the test signal. 




This ringing elTect can easily be understood by slightly rearranging { 12 ) to ; 



oo oo 



;ps(/,y) = ^ 


5(0 


r 

s {i — + s (0 


r n 

s{( + 




- 


-OO - 


-oo 



(SO) 
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With the appropriate changes of variable, we get 



s{t) 









*— V 

— oo 



= y [^(0 [^(0 * + s\i) [S(0 * 

= Real [ 5*(0 [5(0 * ] . 



( 81 ) 



But 5(0 * is the output of a filter whose impulse response is a complex exponential. 
IPS is, thus, for each frequency, the real part of the product of the signal's complex 
conjugate and the output of an infinitely narrow non-causal filter (non-causal oscillator) 
(Figured). (Ref. 12] 




Figured. Model of IPS 



The ringing is hence unavoidable, and will persist until the filter's impulse response be- 
comes negligible. We should consider stable filters, with decaying impulse responses, if 
we want to diminish this effect. This reasoning leaves us very close to the work of Fano 
(Ref 18], which was later extended by Schroeder and Atal (Ref 19]. 
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If we replace the oscillators in the IPS definition by stable filters with impulse re- 
sponses of the form 

h{i) = , (82) 



where w(/) is any real and symmetric function of t with general lowpass characteristics 
and vv(0) = 1, then the resulting distribution £>(/, J) is given by 



/),(/, y) = Real [5*(t)W/) * 

= y [^( 0[^(0 * + 5 *(/)[ 5 (/) * 



(83) 



s(/)s ( t ) vv (/ — t )<? ''^dz -\- 



s (/)5(u)vv(/ — ''^dv. 



Substituting /j for (/ — t) in the first integral, and — for (/ — a) in the second integral, 
we get 



y) = T 



[s(/)s*(/ - /,) + s\i)s{i + /,)] u.-(/,) c 



J 

— OO 



(84) 



— J) 



where y{z) = u'(t —i). 

That is, the resulting distribution is the IPS of the windowed signal, where the win- 
dow is the envelope of the filter impulse response, centered at the time of analysis. Since 
u(/ — t) is real and synunetric around /, we see from ( 66 ) that a smoothing of the IPS 
in the frequency direction will result. 

IPS in general can hence be written as 



IPS{,,f) = ^ 



[s(/)s (/ — t) -I- 5 (/)s(/ -I- t)] vv(0)vr(r)c ^^'‘-^'dz 



(85) 



— OO 



where w(z) is the chosen window. 
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To illustrate the efTectiveness of the windowing operation, we computed the IPS of 
the test signal used in Figure 3. A Hamming window was used in ( 85 ), and the results 
are shown in Figure 5. As is observable, the ringing is avoided. 



Figure 5. IPS for a linear chirp - Hamming window 

B. PROPERTIES AFTER WINDOWING 

Some of the properties of IPS will be affected by the windowing operation. It will 
thus be necessary to determine which properties are modified and establish the con- 
nections between the characteristics of the used window, and the effects it generates. 

Tracing back the windowing operation to the kernel function (0(i), t)) in ( 20 ) that 
it implies, we will be able to use well established results [Ref 14] about these functions. 
The resulting kernel function will be 




O(o, t) = w(0)u'’(t) cos(7n;T) , 



(86) 



which can be proved using ( 19 ) as follows : 



oo oo oo 



r r r 



IPS{1, J) = 



^v(0)u (t) cos(7tuT)s(r, .(87) 



%/%/%/ 

— OO — OO — oo 
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Replacing the cosine by complex exponentials, vve have 



IPSii. J) 





r r 

vv(0)\v(r) 




2 






— oo — oo 

oo oo 



( 88 ) 



+ 





r r 

u-(0)w(t) 


J 


2 

J 



+ y )^‘(h - y ^ 



— OO — oo 



and, by convenient change of variables. 



IPS{i, J) = 



OO oo 






1 



J J 

— oo —oo 



oo 

s(^)s {fx — r)e^^“''^dixe~^^"^'''^~Pdudx 

—OO 



+ 



r * 


c r 

u-(0)u-(t) 




2 



s iiJ.)s{iJ. + T)e^'^^^^d{xe dudr 



— oo — oo 



(89) 



Interchanging the order of integration, we obtain 



/ps(t.y) = ^ 



oo 



[s(/)5 (/ — t) + 5 (/)5(r + t)]w(0)u-(t)c ^^~^dx . 



— oo 



completing the proof. 

We can now easily determine which properties are maintained after the windowing 
operation, by direct application of the results in [Ref. 14), mapping the characteristics 
of the kernel function to the properties of the distribution. The conditions on O(o. x) 
necessary for the preservation of each of the properties will also be given. 
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1. Time shift 

If co(/) = s{i — /o)» then 

IPS Jr, J) = IPS,{i - r„J) . 

The requirement on O(o, t) is: 

t) constant for all t . 

This property will thus be maintained for every used window. 

2. Modulation 

If <y(/) = then 

IPSJ^i, J) = 7P5,(/,/-/o) . 

The requirement on 0(i;, t) is: 

T>(t;, t) constant for all /. 

This property will thus be maintained for everj’ used window. 



3. Marginal in Time 



OO 



lPS{i,J)df=\s{t) \\ 



V 

— OO 



The requirement on 0(i.>, t) is: 

0(i;, 0) = 1 for all v . 

Maintained iff vr(0) = 1 . 



4. Marginal in Frequency 



OO 



IPS(t,J)dt = \S(/) I'. 



%7 

—OO 



The requirement on cl>(c, t) is: 

0(0, t) = 1 for all T . 
Not maintained for practical windows. 



(91) 

(92) 



(93) 

(94) 



(95) 



(96) 



(97) 



(9S) 
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5. Rea hies s 



IPS{t, f) is real for all /, /. (99) 

The requirement on (I)(u, t) is; 

cI>(o, t) = 0*( - u, - t) . (100) 

Maintained if the window is real and even, or at least possesses conjugate symmetr}' 
around the origin. 



6. Instantaneous Frequency 



f-IPSil, J)df 



I ^(0 1‘ 

The requirement on 0(u, t) is: 



= fi{l) {instantaneous frequency) 



0(u, 0) = 1 and t;;— <I>(o, t) |_q = 0 for all o. 

C T 



Maintained if 



• u-(0) = 1 . 

• w'(t) |..o = 0 . 



(101) 



( 102 ) 



7. Group Delay 

OO 

r 

f!PS(t, felt 

~ ~ l ^(/)l^ ^ (S>'oap delay) (103) 

The requirement on O(o, t) is: 

0(0, t)= 1 and 0(u, t) Uq = 0 for all t. (104) 

Not maintained for practical windows. 

S. Zero Power 

5(/o) = 0 => IPS{t^,f) = 0 (105) 

This property is clearly maintained, due to the multiplicative nature of the win- 
dowing operation. 

S(^) = 0 => 7PS(t,/o) = 0 (106) 

This property will not be maintained since, from ( 64 ), a convolution along the 
frequency a,\is will be performed. 
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C. SPECTRAL RESOLUTION 



1. Tinie-iiivariant Components 

Though IPS is a tool directed towards the analysis of signals with time- varying 
spectral contents, some insight into its inner workings will be gained by considering its 
resolution capabilities for time-invariant spectral components. A comparison of resol- 
ution capabilities for IPS and WD will be made, and contrasted with the standard 
Fourier Transform. 

As seen in ( 6 ), IPS is a coherent average of two terms, one of which uses only 
past data, while the other uses only future information. When analyzing finite duration 
signals, the "past term" will have its maximum resolution capability at the end of the 
data segment, since it is at this point that the most past data is available. Similarly, the 
"future term" will resolve better at the beginning of the data segment. This can be seen 
in Figure 6 and Figure 7, for the past and future terms, respectively. 




Figure 6. Sinusoidal component. Past term. 3-D (a) and Contour (b) plots 



If the two terms are averaged as in ( 6 ), the resulting expression (IPS) will keep the good 
resolution at end-points that each component provides, degrading its resolution 

^ A note of caution should be made here. These "past" and "future" terms should not be 
identified with the two terms in ( 12 ), since each one of the terms in tliis formula spans all the data, 
from — CO to CO. 
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capabilities towards tlic center of the time segment. This efl’cct can be observed in the 
contour plot of rigurc 8. 




Figure 7. Sinusoidal component. Future term. 3-D (a) and Contour (b) plots 




Figure 8. IPS for a sinusoidal component. 3-D (a) and Contour (b) plots 
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This is one of the characteristics of the unwindowed IPS. It has better resolution at 
end-points than it does in the middle of the data segment. We should point out that the 
WD does the opposite, presenting a thinner main-lobe at the center of the time interval, 
and loosing resolution capabilities towards the end-points (Figure 9). 




Figure 9. WD for a sinusoidal component. 3-D (a) and Contour (b) plots 



A more formal discussion can be made as follows: Let us denote by Cir,^(T) the 
function of t whose Fourier Transform is lFD(/o, f) and, similarly, denote by the 

function of t whose Fourier Transform is IPS{l^, J). Hence, 



CllV^) = /(/o-yM^o + y) 

" ~2 Vo - ■^) + -s*(^oW^o + "f)] 



(107) 



where s{i) is the stationar}’ signal of interest. That is, both IPS and WD create, from the 
data, a new "signal", whose Fourier Transform is the wanted spectrum (Figure 10). It 
is thus the effective duration of this new signal that determines the resolution obtainable 
in the frequency domain. 
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s(i) 




CI^ Cr) 




FT<’} 

Spectrum 1 
at t^ 




rT<-} 



Spectrum 2 
at tg 



Figure 10. IPS and WD : conceptual diagram 



Let us now assume that s(t) is only known within a finite segment of length L, 
and compare the theoretical resolution of IPS and WD at both the end-points and the 
middle of the segment. 

a. End-points 

Since both end-points are treated similarly, we will only consider the start- 
ing point of the known signal, which we will denote by Let us consider that a 
rectangular window of length L was applied to the signal ( Figure 11). It is easily seen 
from ( 107 ) that 



CW, = 

'o 



0 T 7^ 0 

U(/o)l' t = 0 



(lOS) 
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and thus, WD,JJ) (Fourier transform of a delta function) is a constant. On the other 
hand, the windowing of the data implies that a window ( w,(/) ) of length 2L is applied 
to Cl,Jir) (Figure 12). 





Figure 12. Window applied to 
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When we now take tlic Fourier transform of C/,^(t) , we will be taking the transform of 
a signal twice as long as the original data. This creates a main-lobe twice as narrow as 
the main-lobe that would be obtained if we were to Fourier transform the data. How- 
ever, we know from ( 107 ) that CI,J(t)= C/,^( — t). This then implies that, from the 
obtained segment of length 2L, only half contains information. As a result, at end- 
points, IPS presents a main-lobe twice as narrow as the one obtainable by Fourier 
transforming the data, but without an equivalent improvement in clTective resolution. 
b. bliddle-point 

We now consider that the window is symmetrically placed around /q, as in 

Figure 13. 




Again, it is readily seen from ( 107 ) that windowing the data with a window of length 
L centered at implies the windowing of CIF,g(r) with a window of length 2L. The elfect 
on C/,^(r), however, is to window it with a window of length L. I his discrepancy implies 
that, in the middle of the interval, the WD has twice the effective resolution of IPS. Let 
us note that, as discussed before, for both C1 F,^(t) and only half of the length 

carries information. Hence, their Fourier-transform will have its main-lobe artificially 
narrowed by a factor of two, without improvement in effective resolution. In sununary, 
the WD has at the middle of the time interval the same effective resolution that IPS has 
at end-points, and hence the same effective resolution obtainable by Fourier- 
transforming the data. 
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This fact of IPS having twice as poor resolution in the middle of the time 
interval than at end-points will be important, since the windowed implementation 
normally uses the window centered at the time of interest. This then means that these 
implementations are using, at each time, the worst estimate (in terms of resolution) that 
IPS will provide. The model of a typical windowed implementation of IPS can be seen 
in Figure 14. 




Figure 14. Typical iniplenientatioii of IPS 

The overall result is that, for these implementations, IPS has, by a factor of two, lower 
effective resolution for stationary components when compared with the WD. 

2. Time-varying Components 

One of the most noticeable characteristics of IPS is the fact that the width of its 
main-lobe depends on the dynamics of the signal. A typical example can be seen in 
Figure 15, where a quadratically chirped FM signal is used. 
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Contour plot 
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Figure 15, IPS for FM signal - Quadratic chirp. 3-D (a) and Contour (b) plots 



This test signal has its instantaneous frequency increasing quaclratically with time. '1 he 
width of the main-lobe is directly proportional to the slew rate. In Figure 16, a cross 
section along the frequency axis of IPS for the same test signal is presented. The past 
and future terms are also indicated. 
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As is seen, the width of the main-lobe of IPS results from the imperfect alignment of the 
two terms; the past term seems to lag the true position of the instantaneous frequency, 
showing a tendency to be positioned in locations previously occupied by the signal, while 
the future term leads the true location of the instantaneous frequency. This lag and lead 
effect become more severe for faster dynamics, hence broadening the main-lobe of IPS. 

In an attempt to explain this feature, one might be tempted to argue that IPS 
lacks symmetry in its definition. Despite being clearly insufficient, as we will see, it will 
be instructive to pursue this type of reasoning a little further, since some additional re- 
sults will emerge. When computing we are, for each t (without loss of generality 

assume t > 0 ), coherently averaging two terms. One results from the present and past 
signal history (jr*(r)ar(/ -f t) ), and the other results from present and future signal history 
(.v(r).v‘(/ + t) ) . To obtain IPS, we will extract the frequency contents of the sequence so 
obtained. However, if the signal has frequency dynamics, each one of the terms will 
contribute differently to the frequency contents of . For example, if the instanta- 

neous frequency of the signal is increasing in time, the contribution of the present-future 
term (.Y‘(f).v(r + t) ) will have higher frequencies than the present-past term, since it is 
centered in a region with higher average instantaneous frequency. The temptation thus 
arises to compensate for the difference in the centers of the two regions. This effectively 
corresponds to forcing the two terms in Figure 16 to align "correctly", thus eliminating 
the broadening of the main-lobe. To do so, it suffices to introduce compensating phase 
terms in the process of computing IPS (see . Alternate way of computing the Wigner-Ville 
below). When these compensating terms are introduced, the distribution that results is 
the Wigner-Ville distribution. We can now interpret WD as: the WD is the distribution 
that results by aligning the two terms of IPS. 

• Alternate uaj' of computing the Wigner-Ville 



From the previous discussion, an alternate way of computing the WD can 
be derived, which allows us to directly apply the WD to real signals sampled at the 
conventional Nyquist rate, thus avoiding the need for oversampling that would be 
required if the WD was to be computed in the usual way [Refs. 7, 10]. The two 
step procedure is: 

1. Compute 



OO 



Fir, u) = j- 



%) 

— OO 



[5(/)s (r — t)c (t)5(/^ + dt 



(109) 
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2. Compute 



oo oo 






^yD{t, J) = 



F(t, v)e 



( 110 ) 



J J 



— oc — oo 



As can easily be recognized, the need for a 2-D Fourier transform places the 
cost of this procedure well above the one needed if an interpolation scheme is used. This 
consideration may disappear in emerging fields such as optical signal processing. 

All our previous discussion has been based on a loosely defined lack of symme- 
trv', intimately related with the concepts of past and future. As pointed out earlier, this 
explanation is clearly insufficient. To illustrate why, a difierent definition of "Instanta- 
neous Power Spectrum" is made in Appendix E, thus creating a new distribution. The 
important point to be made is: not only are the concepts of past and future inexistent 
in this new definition, but it also possesses perfect symmetry. However, the resulting 
distribution is ver\’ similar to IPS, with the characteristic data-dependent width. Since 
lack of symmetry cannot be argued for this definition, the explanation for the broaden- 
ing of the main-lobe must be found somewhere else. 

In Appendix C, the issue of uncertainty is addressed. One important result to 
extract from there is; the maximum obtainable frequency resolution when analyzing 
signals with unknown frequency dynamics is Jdfjdi . This result is in agreement with the 
fact, proven by Rihaczek [ Ref 4], that signals with strong phase modulation have, at 
each time, their energy concentrated within a frequency band of size 



where <!>(/) is the signal's phase. It thus explains the broadening of the main-lobe of IPS 
in terms of the expression of uncertainty. Since the uncertainty region grows for in- 
creasingly faster dynamics, so does the width of the main-lobe. Put in simpler terms, "the 
faster it moves, the harder it is to locate". According to the results of Appendix C, any 
algorithm should behave in this manner, unless it has, or assumes (directly or indirectly), 
information concerning the dynamics of the signal. However, not assuming is the key 
rule for a robust algorithm, one whose performance does not depend on the class of 
signals being analyzed. 




(Ill) 
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D. THE CHOICE OF A WINDOW 

We know from ( 66 ) that, if we pre-window the signal with a window real and 
symmetric around the time at which we are evaluating IPS, the resulting IPS will be: 



J) = 



lPS,{i,v)-IPS^{i,f- v)dv 



— oo 



( 112 ) 



But 



OO 






/PS,(r.y) = -^ 



[vv(r);v(r — t) -f- + t)]c 



and, since u-(/ — t) = vr(/ -t- t), 



(113) 



OO 



IPS.il, f) = 



n'(r)H'(r -r t)c 



= w{t)\V(f)e'^^-P 



(114) 



where w^(t) = w(r -f- t). that is. vv„ is the used window shifted to the origin. 

Hence, if a real and even sliding window is used, the smoothing function in ( 112 ) 
is the Fourier Transform of the window vv„(t) . It is interesting to compare this result 
with the Pseudo- Wigner Ville (frequency smoothed Wigner-Ville), where the smoothing 
function is not the Fourier Transform of the window itself, but the Fourier Transform 
of the square of the window (in a rescaled frequency axis) [Ref. 6]. We thus see that all 
the knowledge on window functions for the Fourier Transform can be directly applied 
to the choice of a window to be used with IPS. 

Though the smoothing function in ( 1 12 ) is always real (we are considering only real 
and even windows), it may be negative. This can contribute to the presence of strongly 
negative values in the resulting IPS. especially if the side-lobe structure of the window 
is not controlled. This thus suggests that windows with good side-lobe suppression 
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should be preferred. Spectral resolution is another important issue to be considered 
when choosing a window. 

As discussed in the previous section (Section 111-C), IPS has a main lobe whose 
width is proportional to ^dfjcii , in accordance with the uncertainty principles of Ap- 
pendi.x C. Hence, degrading the apparent frequency resolution of the unwindowed IPS 
does not become a relevant issue, if working with windows whose effective duration is 
well above the reciprocal of the uncertainty region. The width of this region should thus 
be the criteria for the choice of window size, but would require a-priori knowledge of the 
dynamics of the signal. In the absence of this knowledge, optimality will not be achieved. 
However, as we will see. IPS is a very forgiving tool, in the sense that its performance 
is not seriously affected even for large deviations from the optimal window size. 
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IV. EXPERIMENTAL RESULTS 



To illustrate the types of behavior predicted in the previous sections, some test cases 
will be presented. For all the cases the test signals are 128 point sequences, with one 
spectrum for each sample. All plots are on a linear scale. Some considerations relating 
to the Pseudo Wicncr-Ville (PWD) were made; all test signals are analytic, thus freeing 
the PWD from the somewhat annoying interferences between positive and negative fre- 
quencies. The exception to this rule will be Figure 19, since its sole purpose is to 
illustrate the effects of the use of real signals. Also, when noise is present (Figure 24), 
analytic noise is used. Unless otherwise stated, all Figures use a 41 -point Hamming 
window, and present the distributions after smoothing along the time direction. Hence, 
most of the spectral cross-terms of the PWD due to multi-component signals do not 
appear in the plots. To illustrate the effectiveness of time-smoothing. Figure 17 was 
provided, diflering from Figure 21 only in the fact that the time-smoothing technique 
was not used. 




Figure 17. FSK. IPS (a) and PWD (b). No time-smoothing. 



A. REAL SIGNALS 

In Figure 18, IPS and PWD for a linear analytic FM chirp are presented. 
Comparing Figure 18 with Figure 19, where the test signal is a linearly chirped cosine 



44 




function, \vc can appreciate liow insensitive IPS is to the fact of the signal being real. 
Ihe artifacts present in the PWD when analyzing real signals arc, for all practical 
purposes, absent in IPS. 




Figure 18. Analytic linear FM chirp. IPS (a) and PWD (b). 




Figure 19. Linearly chirped cosine. IPS (a) and PWD (h). 
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B. INSTANTANEOUS POWER 



Another visible elEcct in Figure 19 is the "amplitude modulation" of the main-lobe 
of IPS. IPS gives us the distribution in frequency of the instantaneous power of the sig- 
nal. It then follows that, when computing the spectrum at a low power sample, the 
amplitudes of the frequency components will have to be small, since their total power 
must add up to the low power of the sample. This is a direct consequence of ( 95 ) and 
is a requirement that has be fulfilled by any positive distribution. It is not observable in 
Figure 18, since the used analytic signal has the same power at all samples. 

The WD, though also obeying the Marginal in Time property, tends to absorb the 
Iluctuations of the instantaneous power in its cross-terms, and, in order to do so, is 
forced to assume strongly negative values. 

C. END-POINT RESOLUTION 

Though the better resolution at end-points provided by IPS is in general well visible 
in all the plots that are presented. Figure 20 is probably one of the most representatives. 




Figure 20. FSK (127-point Hamming uindow). IPS (a) and PWD (b). 
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Considering it, we see that not only is the end-point resolution of IPS better, but also 
the transition times are shorter, allowing a more accurate definition of the time of spec- 
tral jumps. 

As stated in Section Ill-D, IPS is a forgiving tool, in the sense that we can deviate 
from the optimal length of window without severely degrading its performance, 'fo 
illustrate. Figure 21 is presented, which dilfers from Figure 20 only in that it uses a 
window with dilfcrent effective duration. 




Figure 21. FSK ( 51-poiu( Hamming nindow ). IPS (a) and PWD (h) 



As is seen, the time resolution is basically kept the same, the only noticeable difference 
being the width of the window's main-lobe. 

D. MULTI-COMPONENT SIGNALS 

Performance for multi-component signals is addressed in Figure 22 and Figure 23, 
where the test signal has three components; a linearly chirped F.M signal, a quadratically 
chirped FM signal, and a stationary component. As seen, IPS is free of the cross-terms 
appearing in the PWD. However, for IPS, each one of the spectral components has its 
width affected by the associated uncertainty region and is, hence, wider than the corre- 
sponding one in the PWD. Again, the instantaneous power modulation is visible in IPS. 
These effects arc also apparent in the provided contour plots. 
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Figure 22. IPS for Multi-component signal. 3-D (a) and Contour (b) plots 



i 
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E. PERFORMANCE IN NOISE 

One issue still not addressed is the performance in noise. In Appendix I), a proof is 
given that, lor a signal embedded in additive bandiimited white Gaussian noise, the 
variance of IPS is, on the average,^ smaller than the variance of \VD. Also, from the 
same Appendix, we have that, when the bandwidth of the noise approaches infinity, the 
improvement in the variance given by IPS approaches the limiting value of 3 dI3. 
Figure 24 is provided to illustrate this theoretical result. The SNR for this picture is 
5 dB, and it is seen that IPS does, indeed, present a smoother estimate of the instanta- 
neous spectrum. 




Figure 24. FSK ( SNR = 5 tlB ). II’S (a) and PWD (a) 

F. THE CAPTURE EFFECT 

Having borrowed the term from communication theory (FM demodulation), our 
goal is to compare the behaviors of IPS and PWD when analyzing multi-component 
signals, when the components have different amplitudes. In Figure 25, the two compo- 
nents have amplitudes dificring by a factor a four (12 dB). As is observable, both IPS 
and PWD detect the weaker component, despite the fact of the cross-terms in the PWD 
being already more energetic than the weak signal itself. In Figure 26, the same two 

fhe variance of IPS is time dependent. When we say that "...the variance is. ..on the aver- 
age....", we are referring to the time average of the variance. 
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component signal is presented, now with a factor of 8 between amplitudes (18 dD). It 
is obvious that the PWD lost the weak, component. The cross-terms, however, still re- 
main. In this case, the position of the weaker term could be inferred from the cross- 
terms, but such a technique would be impossible to apply to more complex signals. 




Figure 25. Pouer ratio = 12 dB. IPS (a) and PWD (b) 




Figure 26 . Power ratio = 18 dB. IPS (a) and PWD (b) 
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V. PARAMETRIC IPS 



A. INTRODUCTION 

We have, from ( 12 ), that the IPS of a signal is, at each time, the Fourier transform 
of a function of the signal. This implies that the wanted information lies in the spectral 
content of this function. The process of obtaining the IPS can therefore be reduced to 
the problem of performing conventional spectral estimation on a function G{i, t) which 
is derived from the signal as in ( 11 ). The use of parametric methods should thus be 
considered, in an attempt to improve the resolution capabilities of IPS. These consider- 
ations are also applicable to the WD, and a parametric version of this distribution has 
been considered [Ref 20]. 

B. AR MODELING - CONSIDERATIONS 

Though obtained as a bilinear transformation of the data, the function in ( 1 1 ) can 
not be considered a true autocorrelation function, since it is not constrained in any way 
other than possessing conjugate symmetr\’ around the origin. It can not be guaranteed 
to give rise to a positive definite autocorrelation matrix, and hence can not be used to 
directly solve the normal equations. We can however estimate its autocorrelation func- 
tion and fit the model to this autocorrelation. In a sense, we will be fitting the model 
in the fourth moment of the data. In all the results presented, the .-\R parameters are 
obtained using the .Modified Covariance .Method. This method has been found to pos- 
sess good resolution properties, while alleviating some of the problems associated with 
the .Maximum Entropy .Method (.ME.M) [Ref 21]. All plots are on a logarithmic scale. 

C. EXPERIMENTAL RESULTS 

Choosing the order of the model is a sensitive issue. From Figure 16, we know that 
the main lobe of IPS consists of two terms, which do not align exactly. We nright thus 
expect that, if the correct order is exceeded, the extra poles will tend to resolve these two 
terms, and each spectral component will have two associated peaks in the spectrum. In 
Figure 27, where the test signal is again a single component quadratically chirped F.M 
signal, this elTect is well visible. In the left, a one-pole model was used (analytic test 
signal). With the addition of one extra pole, we can see that the two terms were resolved 

^ Lacking both temporal and statistical averaging. (11) can not be considered to represent the 
second moment of the data. 
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(with some bias), and the undesired elTect is now present. In Figure 28, a fourth order 
model was used to fit a three component signal. 




Figure 27. Paranielric IPS. First (a) and second (b) order models 



We can observe that, since the main lobes were not over-resolved, the signal components 
are well defined, and can be located with greater precision than what would have been 
possible with the non-parametric IPS. 
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Three component signal. Parametric IPS. 


Fourtli-order model. 
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VI. CONCLUSIONS AND RECOMMENDATIONS 



In this thesis, an alternative tool for the spectral analysis of signals with time- 
vary'ing spectral content is presented (IPS). Having its roots in the definition proposed 
by Page of "Instantaneous Power Spectrum", IPS has a defining expression very similar 
to the one defining the Wigner-Ville distribution (WD). Their performance is, however, 
considerably different. The WD approach, though extensively used in the last decades, 
suffers from a number of short-comings. The foldover frequency of its discrete fre- 
quency is located at njl , requiring the use of analytic signals or the use of an interpo- 
lation scheme as an alternative to sampling at twice the Nyquist rate. Interference when 
used with real signals, cross-terms when multi-component signals are present, preference 
of linear dynamics and loss of resolution at the extremes of the analysis segment are 
other problems associated with the WD [Ref 8). The distribution presented here is 
shown to have some advantages over the WD. These are; evenness of performance for 
different dynamics of the signal, direct applicability to real signals sampled at the 
Nyquist rate, reliability when analyzing multi-component signals, and preservation of 
spectral resolution at end-points. Also, IPS is shown to behave better in the presence 
of additive white Gaussian noise (AWGN). As a disadvantage, IPS does not achieve the 
frequency resolution provided by the WD. Due to its robustness, IPS is especially well 
fitted to be used as a front-end tool in non-stationary spectral analysis. The usefulness 
of AR modeling as applied to IPS is investigated. It is found to be a technique very 
sensitive to the chosen order model, but providing some advantage in defining the lo- 
cation of each spectral component. The use of other parametric techniques should be 
investigated. In Figure 29. we can see the FSK signal of Figure 20, this time processed 
with the following expression: 






OO 



— t) — 5 {l)s{l + t)]e . 



— OO 



(115) 



That is, Figure 29 is the imaginary part of the Rihaczek distribution of the test signal. 
As is seen, it shows some natural ability to detect fast transitions in the spectrum, and 
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seems to be a promising area for future work. Also, linear combinations between IPS 
and this imaginary part of the Rihaczek distribution should be investigated. 




Figure 29. Imaginary part of Rihaczek's Distribution - FSK 



Signal synthesis from IPS may also be a promising field of research for speech processing 
applications. 
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APPENDIX A. IPS FOR FINITE DURATION DISCRETE SIGNALS 

In this appendix, a formal derivation of the expression defining IPS for discrete 
signals { 16 ) is presented, as an alternative to the direct discretization of ( 12 ). 

The discrete version of (3) is, assuming that the sequence starts at « = 0: 






S~{n, 0) = Ar2^ 






(116) 



Hence, 



I 0) f 




n=0 




(117) 



where 



Ck = 



5(h)s {n — k) k>0 



n=k 



(US) 



and 



^k Jfe) • 



(119) 



Similarly 



,v-i 



S'^{n,0) = AT) 



n=i 



( 120 ) 
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s(«)5 {n — k) 

n=i-^k 



k>0 



( 122 ) 



and 



Ck 






(123) 



Now. to get the digital equivalents of (2), (4) we must approximate the derivatives of 
( 117) and ( 120 ), which we will do using first order differences; 



|S>.»)P-|S-(/- 1,9)P] 
= Ar^trf-'" 

k=-i 



(124) 



where 

c'^ = s{n)s {n — k) k>0 (125) 

and 



£* = (£_// 



(126) 



In the same wav. 
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(127) 
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where 



= 5 {n)s{n + k) k>0 



(12S) 



and 



4 = (£!/. (129) 

Assuming the signal to be zero outside the known samples, then according to Levin's 
definition (6): 



IPS(n, 6) = \\_p-{n, 0) + p>, 0)] 



^T 



OO 

[ 5(^)5 (n — A') + 5 («) 5 (rt + 

li= — 0O 



which is the wanted result. 



(130) 
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APPENDIX B. INSTANTANEOUS FREQUENCY AND IPS 



The proof of property 6 ( 55 ) can be made as follows: 
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Therefore 



oo 

f IPS{t, J)df 

L Imag[5*(/)5'(/)] 



I s(r) P 271 1 s(0 I' 



( 133 ) 



For real signals, this center of mass is zero, as expected, since the spectrum is conju- 
gate symmetric about DC. 

For analytic signals: 

s{i) = m{i)e^'^^‘^ (134) 



and 
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and. finally 

which is the definition of instantaneous frequency for analytic signals.^ 



This defmilion of Instantaneous Frequency was introduced by V'ille [Ref. 2] in 1952. 
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Hence, we have that, for analytic signals 



f-IPS{t, f)df 
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( 137 ) 



and our proof is complete. 
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APPENDIX C. NON-STATIONARY SIGNALS AND THE PRINCIPLE OF 

UNCERTAINTY 



When analyzing signals with time-varying spectral contents, there is a maximum 
obtainable frequency resolution, determined by the dynamics of the signal. This is in 
contrast with the stationar}' case, where the only constraint on the obtainable resolution 
is the observation time. The presentation and proof of this effect is the goal of this 
chapter. 

A. THE PHYSICAL CONCEPT 

Before presenting any mathematical results, we will try to approach the subject with 
a brief discussion of the involved ideas. 

Let us consider our ability to perform spectral estimation on a stationar}’ signal. 
For that purpose, we will assume that the incoming signal has been segmented into 
contiguous intervals of A/ seconds. Our final goal is to determine the spectrum occupied 
by the signal at roughly the moment of analysis, that is. within the last received segment. 

If we are given only this last segment, the obtainable frequency resolution is con- 
strained by [Ref. 1] 



A/> (13S) 

But if now, for some reason, the next to the last segment of data becomes available, 
we will be able to improve our analysis of the last segment up to a resolution of 

A/i lir 

We can continue the process and, if more segments of data become available, im- 
prove our spectral description of the signal at the present time. The rule in this case is: 

• The stationarity of the signal allows us to improve the analysis of the present time 
by using data collected no matter how distant in the past. Hence, if infinite obser- 
vation time is allowed, infinite spectral resolution can be achieved. 
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Let us now consider the same scenario, but with a non-stationar\' incoming signal, 
v.'hich, we will assume, can be considered stationary’ within the segment. If only the last 
segment is available, our frequency resolution is again constrained by 



and the only way to improve it is to consider more segments. But a qualitative difference 
appears at this point. Segments of the “distant past” are rendered useless by the dy- 
namics of the signal unless we know or assume its behavior. Considering those segments 
will only degrade our spectral analysis. The usable data is thus restricted to the most 
“recent past”, which implies an upper bound on the obtainable resolution. 

• For a non-stationary signal with unknown dynamics, there is a ma.ximum obtaina- 
ble frequency resolution, which depends on how distant the unusable “distant 
past” is. It is an absolute maximum, and depends only on the dynamics of the 
signal. 

Proving this statement is the purpose of this Appendix. 

B. MATHEMATICAL FORMULATION 

Let us briefly discuss Gabor's result. Gabor [Ref 1] defined the ''effective duration" 
A/ and the "effective frequency width" A/ of a signal \p{t) by the following equations 



where [.] stands for the average, and the «''' moments are defined, ommiting the argu- 
ment of as 
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resulting in 



ArA/s 



(143) 



This only states that any signal whose elFective duration is Ai ( see Figure 30 ), will 
have an efTective bandwidth Af ( see Figure 31 ) greater than ; the issue of uncer- 
tainty is not addressed in this result. 
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Figure 30. Effective duration of s(t) 
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Figure 31. Effective bandwidth of s(t) 



In Fact, its relevance as an uncertainty principle only comes into play when we realize 
how classical Spectral Estimation is achieved. 

Classical methods are based on the “measure of similarity” between the data and 
more elementary Functions of time which have strong alFinities with the physical concept 
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of frequency. The best example is the Fourier Transform, which conceptually is a bank 
of correlators, tr\'ing to determine the “measure of similarity” between the data and 
sinusoidal functions, the true representation of our concept of frequency. Other typical 
examples are in the works of Gabor [Ref 1] and Priestley [Ref 22], using elementarx’ 
functions on the t —f plane, where these functions have what Priestley calls an 
“oscillatory form” needed to establish the correspondence to our notion of frequency. 

Now, if the data is of length T, the elementaiy^ functions can have at best the same 
duration, which implies that their frequency width is greater than Since the ob- 
tainable frequency resolution is at best of the size of the frequency width of the ele- 
mentaiy functions, the principle of uncertainty in Spectral Estimation follows. 

Gabor's result is only concerned with stationary signals, or at least assumed to be 
stationary, since A/ is the effective width of the Fourier Transform of the signal. How- 
ever, some implications can also be derived for the non-stationary case. 

Let us assume we have an arbitrary stationary' signal iA(r), which we will consider to 
be analytic to simplify the formulation, with effective duration At, effective bandwidth 
A/ and constant instantaneous frequency. Formally, 



Let us now force the signal to linearly chirp in frequency, by multiplying it with the 
phasor and defining the modified signal by 



where A is a positive real number. Due to the exponential, is now a non-stationary’ 
signal chirping linearly in frequency with a slew rate of 



[ arg[i/'(t)] ] = constant 



(144) 






(145) 



d[ 



(146) 



The following results can be established for 



• Effective time duration 



The efiective time duration will be 




(147) 
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where 



'I' ‘ di 



— oo 



\p \p [ d[ 



oo 

r 









%J 

— oo 



\l/ \pdi 



%) 

— oo 






dl 






if/ t/^ r dt 



= «• 



\p \jjdt 



Hence, 



= 2;: [ (/^) - (7)^] = (A/)^ . 



• Effecihe bandwidth 

The effective bandwidth will be 



AAv/ — \.' ~/m) — 2?r [ (f^f) (4v/) ] • 



where 









1 -o 









— OO 



Using the identity 
dif/ 



M 






( 148 ) 



( 149 ) 



( 150 ) 



( 151 ) 



( 152 ) 



( 153 ) 
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we have that 



•A’! 



,v/ 



dr 



dt 



f.M — 



27t/' 



\j/ if/dt 



oo oo 

,* , 



\j/ ipkidi 



2?l/ 



■ + 



i/' \}/di 



— OO 
oo 



,* d>lf 

* — 



2ry 



+ ;t' 



^ il/dt 



ijy \{/di 



\jj \l/idi 



ip \j/di 



( 154 ) 



=f+kl. 

Hence, all we are ntissing to completely determine A/^, is the expression for {fh) 
where [Ref. 1] 



di di 



di 









( 155 ) 






Noting that 

d^\f dip;:} dip dip* , r d>P , , dip* _ ,^ 22 , ,* 

~ — 3T = + 
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we have 



oo 

r r- 



— oo 



d\b d\b * d\b d4^ ^ o 

— —^-j2nkl\j/ -Vjlnkiii + (27r/c)^/V'A 



di 



(2rr)' 



(j/ \j/di 



j 

—OO 



(157) 



= W+* 



\p rijjdi 



2 — oo 



+ 



- 









\b l — r- dl 
di 



"> 7 T ^ 



i/' ij/di 



2n fT 



\l/ \jjdt 



—OO 



ij/ \l/dl 



•y 

— oo 



Considering now the following identities 
dxjj' il/if/' d 



d 



I’'"' ]+7'W‘-i-i:“rg(«] 



we have 



(15S) 



>A4*^[arg(i/()]yt 



27T7 , A- ^>0 



if si) = (rl + + ~ 



r 



lb \bdi 



— oo 



and, using ( 144 ) we get 



{fsi) = {f) + k\t^) + 2klf,. 



(159) 



(160) 
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Since [Ref. 2] 



we have, by ( 144 ) that 



\l/fi \j/ dt 



/=■ 



1 -oo 



\j/ \j/di 



(161) 



From where, 



\p\l> dt 



f=f^ =fi- 



\p i/di 



— oo 



ifsi) — + 2A:// . 

Hence, from ( 151 ) and ( 154 ). 

A/u = + A-(?) + 2A-F/] - 2;t[aV + k\l)^ + 2AF/] 

= 2n 



if) -(/?] + ] 



= (A/)‘ + A'(A0' . 



(162) 



(163) 



(164) 



Therefore, using ( 150 ) we can write the effective time and frequency widths of the 
modified signal il/^it) in terms of the effective widths of the original stationary signal 
ip{[) as 



{AfJ = {^f)^ + k\Atf. 

2 2 
(At,,)' = (A;)'. 



Using ( 143 ) and ( 165 ), we get the result 
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( 166 ) 



= (A/)' [ (AT)' + /: W ] 

= (A/A/)- + k\^tf 
>^ + k\M)\ 



from where 






1_ 

4(A/)' 



+ k^i^tf . 



(167) 



Hence, if we minimize the right hand-side of ( 167 ), we will find a lower bound on the 
effective bandwidth of the chirped signal. Proceeding with the minimization, we have 



c 

dAt 



1 

- 4(A/)' 



-f k\Aif 



= 0 => 2A^(A/) = 0 

4(A/)' 



=>A'^A/ = 
=-(A/)" = 



1_ 

4(A/)' 

1 



4k^ 



=>A/ 



\ 




Hence 



(A/v/)'> 4-^-4 



A/v/ ^ \ ^ 



and we can read the obtained results as stating: 



(169) 



Lemma 1. A linear chirp will always occupy an effective bandwidth greater than 
^■dfjdt , independently of its effective duration. 
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Lemma 2. A linear chirp will occupy the minimum effective bandwidth of yjdfjdi , 
onlv if its effective duration is 



A/ = 



These results will have obvious implications when classical spectral analysis of the 
chirped signal is attempted; no matter what the duration of the elementary functions is, 
the signal will always appear to occupy more than Jdfjdi , and this will thus be the best 
obtainable resolution. This completes our proof 

Lemma i.The maximum obtainable frequency resolution when analyzing signals 



with unknown frequency dynamics is ^dfjdi . 



Though the previous discussion was restricted to the case of a linear chirp, the 
underlying ideas remain valid for laws of variation other than linear; the treated case can 
be considered as a first order approximation for arbitrary non-stationary signals. 
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APPENDIX D. PERFORMANCE IN NOISE 



As we will see later in this Appendix, an absolute evaluation of the performance of 
IPS in noise is not possible, because the second moment is data dependent. We will thus, 
instead, compare the noise performance of IPS with the noise performance of the 
Wigner-Ville distribution. Let us consider the inverse Fourier transforms of IPS and 
WD. 



t) = y [5(/)5*(/ - t) + s\t)s{( + t)] 

versus (172) 

ar,(/,T) = 5(/+ ^)s*(r-^). 



Given a signal z{t) that is corrupted by WAGN, where the noise is analytic and 
bandlimited to the maximum frequency (II^ signal, that is: 



r(t) = r(0 + At(0, 



(173) 



where 



5„(w) 




0 < w < IV 
otherwise 



(174) 



Ki^) = 



•Vr 



sin( 



IFt 



2;r 



IFt 



/JLl 
e' 2 



(175) 



A. MEAN 
1. WD 



ClVy{[, t) = v(t + ^ )v\t - y ) = [ z(M- y ) +n{l + y )]-[2*(/ “ y ) + «*(^ “ ^ 

= z{t + j- )z\t -j-) + z{t + f )n\i -f) (176) 

+ n{i + -^)n (r — — ) + n{t + — )z {t ) . 
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(177) 



E{C]\\{t, t)} = CIV,(L z) + z(! + f )£{rt*(^ - ^ 



Hence, 



£{C11V(^, t)} = ClF,(/, t) + £„(t). 



2 . IPS 



C/yU, r) = j-[ v(/)v‘(/ - T) + v*(r)v(/ + T) ] 

= y [2(0 +«(0]-[2*(/ - t) + «*(/ - t)] (179) 

+ y [2*(0 +«*(0]-[2(; - t) + f7(r - t)] . 

E{C/,(i, r)}=j- z(i)-\r - z) + j- £{«(0«*(r - t)] 

+ y z (0z(r + t) + -^ £|/7 (i)77(r + t)] (1 80) 

= CE(t, t) + y R„(z) + y R„(z) . 



Hence. 



£{C/,(r,T)} = C/,(7.T) + £„(T). (181) 

Therefore, IPS and WD have, as far as their first moment is concerned, identical be- 
havior in the presence of noise. 

B. VARIANCE 
1. WD 

1 CU'M, z) P = v(r -h f )v*(r - f )v*(r + j- )v(i - f ) 

= [z(t + y ) + m(/ + y )][z (r — y ) + rt (t — y )] 

•[z*(r + f) + «‘(r + f )][z(r + 
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I Cl{\,(!, t) P = [.-(/ + f )/(i -f) + 2 (r + f )n\r - f ) 

•[2 (s + f)z(r-j-) + , (t + j-Xr-j-) 

+ rt (/ + "Y )z(t — Y ) + « ^ — "^ )] • 

Using now the expectation operator, we have; 

£{iar,(/, t)|'} = |2(/ + ^)|'|2(/-^)P 

+ 2(r + ^)2*(/-^ )£{//*(/ + -^)} 

+ 1 -(^ + Y ) I ~ ^ } 

+ z(r+.f ) 2 (/ - f )£{»*(/ + >/(/ -j-)] 

. " f " 1 

+ z {i — -^)z (/ + Y + "T ~ 

+ |2(/--^)p£{|,/(/ + -^)|'} 

+ £{/;(/ + j- )«*(r - ^ >;*(/ + ^ >?(/ - j - )} 

+ z\i + j- )z{t - f )£{/,(/ + f W(i - ^ )} . 

And, since^ 

£{/;*(/ + ^ )n \ )} = £{«(/ + Y - T )} = 0 . (I S5j 

£{ I CU\{t, r) I' } = I 2 (r + ^ ) |2 I - X ) |2 + ,(, + j. ),•(, _ X )/^;(,) 

+ I 2 ;j ^ ~ *2" 

+ |2(/-T2)1 ~y^ h I £„(t) I + ^ 2;r j ’ 

See Appendix F. 
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where we made use of the fact that, for zero mean Gaussian random variables 
[Ref 23], 



• = 0 

• £{jr,jr2.V3JC’4} = £’{A:,X2}-£{j(:3jr4} + £{jf,X3}'£{x2Jf4} + £{x,;r4}'£{jrp^3} . 



Hence, 



£{|Or,(r,T) |2} = |^(^ + ^)z*(r-y)P + ^[U(/ + y)l' + U(t-^)P] 



+ -R«(i^)z*(/ + y )r(/-y) + £«(T)z(/ + y )z*(/-y ) 



Now, 



Var{Cir,(r, t)} = E{ \ CIV,(r, r) p) - E{CJV,(r, t)}£{OF,(/, t)}* 

= £{ I or,(r, t) !'} - [CIV,U, t) + £„(T)]-[ar;(/, t) + £;(t)] 
= £{ I Cir,(/, r) l'} - I z(! + f )z*(r - y ) l' - I £„(r) |' 



From where, using ( 1S7 ) and ( 172 ), 



Var{CIF,(/, t)} = [ | z(r + ^ ) j' + j z(/ - ^ ) j' ] + 



2n 



(188) 



(189) 



2. IPS 



I t) 1^ = y [i-(t)v*(t - t) + v‘(/)v(/ + t)][v‘(/)v(/ - t) + v(/)v‘(/ + t)] 

= y - t)v(/ - t)] + y [v*(r)v(/ + T)v*(/)v(r - t)] ( 190 ) 

+ y CK0v’*(^ - T)r(/)v*(t + t)] + y [v*(/)v(/ + t)v(/)v>*(/ + t)] . 

For simplicity, we will treat each term in ( 190 ) separately, and use the fact that 
the second and third terms are complex conjugates of each other. 
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• First term 



-J [v(/)v*(:)/(/ - t)v(; - t)] | = 

= J £{[2(0 + n(0][2*(0 + rt*(0][2*(/ - t) + n\i - t)][2(/ - t) + n{[ - r)] ) 

= ^ E{iz{t)z\l) + 2(0«*(0 + 2*(0«(0 + rt(0«*(/)] 

•{_z\t - r)z{l -T) + z\t- r)n{[ - t) ( j 9 j ) 

+ z(t - - t) + «’(/ - t)«(/ - t)]| 

= -J [ 2 ( 0 ^ W- (I - - t) + I z(0 p£{ I tJ{i - t) I^) 

+ z{l)z (/ - t)£{/7 (0rt(^ - t)} + 2*(0-^(^ - r)E[n{l)n\i - t)| 

+ I 2(/ - r) |'£{ I nil) + £{,/(0«*(0«(^ - - r)}] . 

Simplifying, 

e[ j [v(0v‘(0v*(^ - t)v(/ - t)] j = ^ 1 z{l)z\t - t) I' 

+ ^ - 2;j [ I ^ 2(02*(^ - t)£*(t) (192) 

1 * 1 / I 

+ f z it)zit - t)£„(t) + ^ j + ^ I £„(t) |- . 

• Second term 

^{^[v*(0v(/ + t)/(0i-(/-t)] j = 

= ^ £{[z‘(0 + //(0][2(/ + t) + nil + t)][ 2*(0 + «*(0][2(r - t) + ,/(/ - t)] } 

= £{[2 (0^(f + t) + 2 ii)nil + t) + rt ii)zil + t) + n\l)nil + t)] 

•[2*(0r(r - t) + z\i)nii - t) + n\i)zii - t) + «‘(/)«(r - t)] 

= - f) + z\l)zil + T)£{rt*(0«(/ - t)} 

+ z\l)zil - t)£{«(/ + t)«‘(0}2*(0z(^ + t)£{«*(0/7(/ - t)} 

+ 2 (7)2(7 — t)E\p ( 7 ) 77(7 + t)| + £[77 ii)n ( 7 ) 77(7 + t)/7(7 — t)|] . 

E ^ ^ [v*(7)v(7 + t)v*(7)v(7 - T)]| = ^ 2*(7)2*(7)2(7 + t)2(7 - t) 

+ + -^2*(7)2(7 - t)£„(t) (194) 

+ ^ 2*(7)2(7 + t)£„*(t) + ^ 2*(7)2(7 - t)/?„(t) + ^ | /?„(t) |' . 
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• Third term Since this term is the complex conjugate of the second term, we 
have: 

-J + t)] j = -j z{t)z{t)z\t + T)z*(r - t) 

+ -j z{t)z*it + + -j z{t)z\t - (195) 

+ -J 2(r)z*(r + T)f?„(T) + -j z{t)z\r - r)Rl{r) + y | R„{t) P . 

• Fourth term 

£{-j[v‘(/)v(t + T)v(/)v*(r + T)] j = 

= y £{[z (r) + n (t)][z(r + t) + n{t + T)][z(r) + n{ty]{z (/ + t) + At (/ + t)] | 

= y £{[z (r)z(r + t) + z {l)n{t + t) + « (t)z(t + t) + « (/)/?(/ + t)] 

•\z{i)z (/ + t) + z(/)rt (/ + t) + n{i)z (r + t) + «(/)«*(/ + t)] 

= ^ [z*(r)z(/ + T)z{i)z\i + t) + z\i)z{i + T)E{n[t)n{t + t)} 

+ I z{t) |'£{ I n[t + t) P} + I z(r + r) \^E[ \ n(t) |'} 

+ z(r)z (t + t)£|a; (:)«(/ + t)] + £{« (t)f7(t)n(r + t)« (t + t)|] . 

y [AOyU + T)v(/)v*(r + t)] j = ^ 1 z\t)z(t + t) P 

+ y [ I HO 1“ + I Hr + r) p] + ^ z*(/)z(r + t)<(t) (197) 

+ z(t)z*(/ + z)RHt) + j- ( ^ I R„(r) p . 

Adding the four terms ( 192 ), { 194 ), ( 195 ) and { 197 ), we thus obtain the 
expression for the second moment of IPS. Now, 

Var{C4,(r, t)} = £{ | C/,(r, r) p} - E{C/,(t, r)}-E{C/,(t, t)}‘ , (198) 

where 

£{C/,(r, T))'£{C/,(r, t)}* = [C7,(/, t) + £„(r)]-[C/;(/, t) + R^] 

= C/,(t, z)C/Ht, z) + C/Hr, t)/?;(t) (199) 

+ C/;(:,t)£,(t) + |£„(t)P. 
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Substituting in ( 198 ), we get that 



Var{C/,(/, t)} = £{ | C/,(/, t) |^} - E{CI,{t, T)}-£{C/,(t, t)}* 

N' j 

= |' + |z(/-r)|' + |z(/)|' + |z(t + T)|'] 

+ -jz{l)z\t - r)R„(r) + -j + r)K{T) 

+ z\t)z{t - t)R„{t) + -J z{{)z\l + t )£„( t ) 

+ 2 ' 2 I 2n 



( 200 ) 



We can now compare the variances of C7,(/, t) and r) . First of all we 

realize that, in both cases, the variance is a function of r and t. If t = 0 , we have: 



AolK , 

Var{C/,(r, r)} ^ | z{i) + 



2k 



( 201 ) 



.Vnir 5 / A’oir 

Var{cn-,(r. r)} =-^ I z(/) + f 



( 202 ) 



That is. 



Var{C/,(/. t)} = Var{CMV(/, t)} (t = 0) 



(203) 



For T 7 ^ 0, we have from ( 200 ) that 



1 

Var{CA(/, T)}<^^^ 



[ I z(r) 1^ + I z(r - t) 1^ + I z(/) + | z(/ + t) ] 



+ I £„(t) I [ I z{t)z\i - t) I + I z\i)z{t + t) I 
+ |z>)z(r-T)| + |z(/)z*(/ + T)|] 



(204) 
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Var{C/,(/, t)} < ^ [ I 2(r) + | z(/ - r) |' + | 2(1) |' + | z(^ + t) |' 

+ I 2(0 I I + "t) I + I z (0 I I 2 (/ - t) I + I 2(0 I I 2 (/ - t) I ( 205 ) 

+ I ^(0 I + ■'■) I ] + 2r: J ' 



The lime dependency of the expression stops us from a direct comparison with 
CirXr, t). However, since the signal is arbitrary, we have no reason to assume any or- 
dering of its magnitude at different points in time. We can thus average over time, and 
get the following result: 



<Var{C/,(/,T)}>,, < I 2(/) -h < | 2(1) | | 2(1 -h t) | 

4- < |z(/)||z(/-T)| >,,+ < |2(/)||2(r-T)| >, 

2 



< | 2 (r)| | 2 (r + T)| >^J + 



Ayr 



Apll- 

— 7t 



< U-WI >av + 



ApH- 

In 



(206) 



Using ( 1S9 ). 



< Var(C/,(r, t)} ><,,<< Var{Cli;(r. t)} > ,, 



and finally, we have 

• T = 0 



Var{C/,(r, t)} = Var{ClJ;(r, t)} 



(207) 



(208) 



• T 0 

< Var(C7,(/, t)} > ,, < < | Var{CIi;(/, t)} | > ,, (209) 

For the case t # 0, 1J'>1, we can make the following approximations, using 
( 189 ) and ( 200 ): 

. , 1 f AplF \2 

<Var(C/,(/,T)}>,,^ 2 V"^J 

f AplF \2 

< Var(Cir,(r, t)} > j (211) 
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That is, when the bandwidth of the noise approaches infinity, the difference in 
noise performance between IPS and WD approaches the limiting value of 3 dB. 
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APPENDIX E. A DIFFERENT DEFINITION OF "INSTANTANEOUS 

POWER SPECTRUM" 

In accordance with the discussion in Section III-C, the following definition of In- 
stantaneous Power Spectrum can be made. 



• Definition - Instantaneous Power Spectrum at the time of a given sample is the 
frequency content of the stationary' spectrum implied by the existence of that 
sample. 



That is: reading the Fourier transform as a change of basis in an N-dimensional 
space, the Instantaneous Power Spectrum will be the difference in coordinates (referred 
to the phasor axis) between the location of the signal in the X-dimensional space and its 
location in the (N-1 )-dimensional space that results from suppressing the dimension 
corresponding to the sample of interest. 

Since in terms of the internal product, the absence of any dimension can be repres- 
ented by a zero component of any of the vectors along that dimension, this definition 
leads to a three-step procedure: 

1. Compute 

|X(co)|', (212) 

where 

oo 

A'(co)= ^ w(«)e--''‘"" . (213) 

n=—oo 



2. Define a new sequence, obtained by zeroing .v(«) at the sample of interest, that is: 

(214) 






« # r 
n = r 



where r is the time at which the Instantaneous Power Spectrum is to be determined, 
and obtain 



I l' . 



(215) 



3. Obtain the Instantaneous Power Spectrum as: 
Instantaneous Power Spectrum — 



I - I A',(«) l' . 



(216) 
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The resulting distribution will be ( using extended sequences, as in Appendix A ) 



A^cu)| =X{o3)X\co) 



oo 

oo 

n=—oo ^ 



( 217 ) 



and, after some manipulations. 



= ( 218 ) 



where 



C]^ = y^.T(w)x {n — k) k>0 (21 9) 

n=k 



and 



^-k — ^k 



( 220 ) 



Correspondingly, 



= ( 221 ) 



where 



bi^ = 'y'x^{n)x^ {n — k) k>0 (222) 

n—k 



and 




( 223 ) 
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Hence, we have that 



QO OO 

I Xioj) l' - I X,{c) l' = ^ - Y, 

ic=— OO jc=— OO 

OO 

= Y “ I ’ 

A:=— OO 



(224) 



where 



Aj^ = [jc(r)jr*(r — A) + x{r + k)x (r)] k>0 (225) 

and 

A_, = a;. (226) 

that is, this definition leads almost exactly (up to a constant, for fixed time) to IPS. 
What is important to realize is that this definition has no concept of past or future, and 
that all points are equally treated, no matter what point they occupy in the sequence. 
This result seems to put an end to our hope of finding assymetries in the IPS concept. 

As a final note, a more formal proof of the relation between this distribution and 
IPS can be made as follows: Define 



Hence, 



Let 



_ f.r(rt) 

xfn) I 

‘'W = {“(») 



r 


(227) 


n = r 


n ^ r 


n — r 


(228) 



;c(«) = xfn) + d{n ) . (229) 



X(cu) = F{x{n)] 

D{<o) = F{d{n)) (230) 

X/co) = F(x/u)} . 
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Then, 



A'(w) = A'^(a;) + D{oi) 

and 

I X{oj,) \ - I .V,(to) l' = [X,{co) + Z)(o;))(Ar;(a;) + Z)*(o;)) - X»A;(a>) 
= X,{oj)D\vj) + X]{(o)D{(X) + D{oj)D\(o) . 

But we know that Z)(ca) = x{r)e~J'"'. Therefore, 

I X{(o) 1^ - I X,{(0) 1^ = 2Re{A;(co);c*(r)e^“^^} + D(co)x\r)e^^' 

= 2Re{ [A;(co) + D{co)2 - | x{r) |' 

= 2Re[ A'((u).t (r)c^“'^| — | x(r) ^ 

= 2-Ar/PS - I ,Y(r) 1^ . 



(231) 



(232) 



(233) 
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APPENDIX F. ON THE SECOND MOMENT OF ANALYTIC SIGNALS 



The purpose of this appendix is to proof the following theorem: 

Theorem: For an arbitrar}.' analytic random signal z{t), defined as 

z{t) = x(/) +yx(/) , 

where x{t) is the Hilbert transform of x(/), the following identity is satisfied; 

£{z(/7)z(s)} = 0 

Proof: 

E{z{n)z{s)] = E{ \_x{n) +jx{n)K-<s) +Jx(s)] } 

= £{x(«)a-(s)} +J£{x(n)45)} +y£W«).v(s)} - £{.v(rt)x(s)} . 

But, from the properties of the Hilbert transform, we have that 

= ^xx(t) 

Hence, 

£{z(/7)z(s)} = Rxxi’^ ~ •s) + jRxx{n — s) + jRxx{n — s) — Rxx[n — s) 

= 0 . 

Our proof is thus complete. 



( 234 ) 

( 235 ) 

( 236 ) 

( 237 ) 

( 238 ) 
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